SNER-CS: Self-training Named Entity Recognition in Computer Science
نویسندگان
چکیده
Abstract As the number of scientific publications grows, especially in computer science domain (CS), it is important to extract entities from a large CS publications. Distantly supervised methods, generating distantly annotated training data by string match with external dictionary automatically, have been widely used named entity recognition task. However, there are two challenges use methods NER One that more and new tasks, datasets proposed rapidly, which makes difficult build knowledge base high coverage. The other noisy annotation, because no uniform representation standard domain. To alleviate problems above, we propose novel self-training method based pretraining language model label automatic construction system (SNER-CS). Experimental results show SNER-CS performs previous state-of-the-art
منابع مشابه
Named Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملSelf-training and Co-training Applied to Spanish Named Entity Recognition
The paper discusses the usage of unlabeled data for Spanish Named Entity Recognition. Two techniques have been used: selftraining for detecting the entities in the text and co-training for classifying these already detected entities. We introduce a new co-training algorithm, which applies voting techniques in order to decide which unlabeled example should be added into the training set at each ...
متن کاملArabic Named Entity Recognition
Stemming is the process of reducing words to their stems or roots. Due to the morphological richness and complexity of the Arabic language, stemming is an essential part of most Natural Language Processing (NLP) tasks for this language. In this paper, we study the impact of different stemming approaches on the Named Entity Recognition (NER) task for Arabic and explore the merits, limitations an...
متن کاملNamed Entity Recognition Approaches
Recognizing and extracting exact name entities, like Persons, Locations and Organizations are very useful to mining information from text. Learning to extract names in natural language text is called Named Entity Recognition (NER) task. Proper named entity recognition and extraction is important to solve most problems in hot research area such as Question Answering and Summarization Systems, In...
متن کاملNamed Entity Recognition in Estonian
The task of Named Entity Recognition (NER) is to identify in text predefined units of information such as person names, organizations and locations. In this work, we address the problem of NER in Estonian using supervised learning approach. We explore common issues related to building a NER system such as the usage of language-agnostic and languagespecific features, the representation of named ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of physics
سال: 2023
ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']
DOI: https://doi.org/10.1088/1742-6596/2506/1/012007